Emergence of Vocal Developmental Sequences in a Predictive Coding Model of Speech Acquisition
نویسندگان
چکیده
Learning temporal patterns among primitive speech sequences and being able to control the motor apparatus for effective production of the learned patterns are imperative for speech acquisition in infants. In this paper, we develop a predictive coding model whose objective is to minimize the sensory (auditory) and proprioceptive prediction errors. Temporal patterns are learned by minimizing the former while control is learned by minimizing the latter. The model is learned using a set of synthetically generated syllables, as in other contemporary models. We show that the proposed model outperforms existing ones in learning vocalization classes. It also computes the control/muscle activation which is useful for determining the degree of easiness of vocalization.
منابع مشابه
Emergence of Vocal Developmental Sequences in Speech Acquisition using a Unified Model of Perception, Action and Learning
Learning to speak a language requires, at the very least, the ability to perceive speech from the noisy, uncertain and dynamic environment, establish the sensorimotor mapping between auditory and articulatory spaces, and control the vocal apparatus to produce a meaningful speech sound. Unearthing the underlying mechanisms will immensely help in language training and rehabilitation approaches to...
متن کاملThe role of intrinsic motivations in learning sensorimotor vocal mappings: a developmental robotics study
Learning complex mappings between various modalities (typically articulatory, somatosensory and auditory) is a central issue in computationally modeling speech acquisition. These mappings are generally nonlinear and redundant, involving high dimensional sensorimotor spaces. Classical approaches consider two separate phases: a relatively pre-determined exploration phase analogous to infant babbl...
متن کاملPhase equalization-based autoregressive model of speech signals
This paper presents a novel method for estimating a vocal-tract spectrum from speech signals, based on a modeling of excitation signals of voiced speech. A formulation of linear prediction coding with impulse train is derived and applied to the phaseequalized speech signals, which are converted from the original speech signals by phase equalization. Preliminary results show that the proposed me...
متن کاملIssues in clinical applications of bidirectional multi-step predictive analysis of speech
The topic of the presentation is an examination of several methodological problems posed by multi-step predictive analysis of speech when applied with a view to estimating vocal dysperiodicities. Problems that are discussed are the following. First, the stability of the multistep predictive synthesis filter; second, the decrease of the quantization noise by means of multiple prediction coeffici...
متن کاملA Neural Network Model Of Speech Acquisition And Motor Equivalent Speech Production Running title: Speech acquisition and motor equivalence
This article describes a neural network model that addresses the acquisition of speaking skills by infants and subsequent motor equivalent production of speech sounds. The model learns two mappings during a babbling phase. A phonetic-to-orosensory mapping specifies a vocal tract target for each speech sound; these targets take the form of convex regions in orosensory coordinates defining the sh...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016